A novel method for interrogating receiver operating characteristic curves for assessing prognostic tests
نویسندگان
چکیده
Background: Disease prevalence is rarely explicitly considered in the early stages of the development of novel prognostic tests. Rather, researchers use the area under the receiver operating characteristic (AUROC) as the key metric to gauge and report predictive performance ability. Because this statistic does not account for disease prevalence, proposed tests may not appropriately address clinical requirements. This ultimately impedes the translation of prognostic tests into clinical practice. Methods: A method to express positiveand/or negative predictive value criteria (PPV, NPV) within the ROC space is presented. Equations are derived for so-called equi-PPV (and equi-NPV) lines. Herewith it is possible, for any given prevalence, to plot a series of sensitivity-specificity pairs which meet a specified PPV (or NPV) criterion onto the ROC space. This concept is introduced by firstly reviewing the well-established “mechanics”, strengths and limitations of the ROC analysis in the context of developing prognostic models. Then, the use of PPV (and/or) NPV criteria to augment the ROC analysis is elaborated. Additionally, an interactive web tool was also created to enable people to explore the dynamics of lines of equi-predictive value in function of prevalence. The web tool also allows to gauge what ROC curve shapes best meet specific positive and/or negative predictive value criteria (http://d4ta.link/ppvnpv/). Results: To illustrate the merits and implications of this concept, an example on the prediction of pre-eclampsia risk in low-risk nulliparous pregnancies is elaborated. Conclusions: In risk stratification, the clinical usefulness of a prognostic test can be expressed in positiveand negative predictive value criteria; the development of novel prognostic tests will be facilitated by the possibility to co-visualise such criteria together with ROC curves. To achieve clinically meaningful risk stratification, the development of separate tests to meet either a pre-specified positive value (rule-in) or a negative predictive value (rule-out) criteria should be considered: the characteristics of successful rule-in and rule-out tests may markedly differ.
منابع مشابه
Comparison of correlated receiver operating characteristic curves derived from repeated diagnostic test data.
RATIONAL AND OBJECTIVES It is common to administer the same diagnostic test more than once to the same set of patients. The purpose of this study was to develop two statistical methods for estimating and comparing correlated receiver operating characteristic (ROC) curves for data derived from repeated diagnostic tests. MATERIAL AND METHODS Parametric and semiparametric transformation models w...
متن کاملA marginal model approach for analysis of multi-reader multi-test receiver operating characteristic (ROC) data.
The receiver operating characteristic curve is a popular tool to characterize the capabilities of diagnostic tests with continuous or ordinal responses. One common design for assessing the accuracy of diagnostic tests involves multiple readers and multiple tests, in which all readers read all test results from the same patients. This design is most commonly used in a radiology setting, where th...
متن کاملEvaluating diagnostic tests
Anaesthesiologists are increasingly more involved in perioperative patient care wherein interpretation of special investigations is crucial to making therapeutic and prognostic decisions. Furthermore, anaesthetic journal publications increasingly rely on diagnostic tests, without paying sufficient attention to the methodology for evaluation of the predictive ability of these tests, particularly...
متن کاملLevels of C-reactive protein, creatine kinase-muscle and aldolase A are suitable biomarkers to detect the risk factors for osteoarthritic disorders: A novel diagnostic protocol
Background: C-reactive protein (CRP), creatine kinase-muscle (CK-MM) and aldolase A (AldoA) levels are predicted to be realistic biomarkers of osteoarthritic disorders (OADs). The objective of the study was to evaluate the levels of CRP, CK-MM, and AldoA and determine their correlations with risk factors such as inflammation, muscle degeneration, and skeletal muscle damage for OADs. Methods: B...
متن کاملNovel tests for evaluating two ROC curves under paired samples.
Disease prevention is important and can be accomplished by developing diagnostic tests. The receiver operating characteristic (ROC) curve and the area under the ROC curve (AUC) are used to assess the accuracy of diagnostic tests. The assessment for the superiority between evaluating two diagnostic tests is needed when comparing two diagnostic tests. Existing tests are constructed by comparing t...
متن کامل